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1  Introduction 

In  the  TACITUS  project  for  using  commonsense  knowledge  in  the  under¬ 
standing  of  texts  about  mechanical  devices  and  their  failures,  we  have  been 
developing  various  commonsense  theories  that  are  needed  to  mediate  be¬ 
tween  the  way  we  talk  about  the  behavior  of  such  devices  and  causal  models 
of  their  operation.  Of  central  importance  in  this  effort  is  the  axiomatiza- 
tion  of  what  might  be  called  “commonsense  metaphysics” .  This  includes  a 
number  of  areas  that  figure  in  virtually  every  domain  of  discourse,  such  as 
scalar  notions,  granularity,  time,  space,  material,  physical  objects,  causal¬ 
ity,  functionality,  force,  and  shape.  Our  approach  to  lexical  semantics  is 
to  construct  core  theories  of  each  of  these  areas,  and  then  to  define,  or  at 
least  characterize,  a  large  number  of  lexical  items  in  terms  provided  by  the 
core  theories.  In  the  TACITUS  system,  processes  for  solving  pragmatics 
problems  posed  by  a  text  will  use  the  knowledge  base  consisting  of  these 
theories,  in  conjunction  with  the  logical  forms  of  the  sentences  in  the  text, 
to  produce  an  interpretation.  In  this  paper  we  do  not  stress  these  interpre¬ 
tation  processes;  this  is  another,  important  aspect  of  the  TACITUS  project, 
and  it  will  be  described  in  subsequent  papers. 

This  work  represents  a  convergence  of  research  in  lexical  semantics  in 
linguistics  and  efforts  in  AI  to  encode  commonsense  knowledge.  Over  the 
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years,  lexical  semanticists  have  developed  formalisms  of  increasing  ade¬ 
quacy  for  encoding  word  meaning,  progressing  from  simple  sets  of  fea¬ 
tures  (Katz  and  Fodor,  1963)  to  notations  for  predicate-argument  structure 
(LakofF,  1972;  Miller  and  Johnson-Laird,  1976),  but  the  early  attempts  still 
limited  access  to  world  knowledge  and  assumed  only  very  restricted  sorts 
of  processing.  Workers  in  computational  linguistics  introduced  inference 
(Rieger,  1974;  Schank,  1975)  and  other  complex  cognitive  processes  (Her- 
skovits,  1982)  into  our  understanding  of  the  role  of  word  meaning.  Recently 
linguists  have  given  greater  attention  to  the  cognitive  processes  that  would 
operate  on  their  representations  (e.g.,  Talmy,  1983;  Croft,  1986).  Inde¬ 
pendently,  in  AI  an  effort  arose  to  encode  large  amounts  of  commonsense 
knowledge  (Hayes,  1979;  Hobbs  and  Moore,  1985;  Hobbs  et  al.  1985).  The 
research  reported  here  represents  a  convergence  of  these  various  develop¬ 
ments.  By  developing  core  theories  of  certain  fundamental  phenomena  and 
defining  lexical  items  within  these  theories,  using  the  full  power  of  predi¬ 
cate  calculus,  we  axe  able  to  cope  with  complexities  of  word  meaning  that 
have  hitherto  escaped  lexical  semanticists.  Moreover,  we  can  do  this  within 
a  framework  that  gives  full  scope  to  the  planning  and  reasoning  processes 
that  manipulate  representations  of  word  meaning. 

In  constructing  the  core  theories  we  are  attempting  to  adhere  to  several 
methodological  principles: 

1.  One  should  aim  for  characterization  of  concepts,  rather  than  defini¬ 
tion.  One  cannot  generally  expect  to  find  necessary  and  sufficient  conditions 
for  a  concept.  The  most  we  can  hope  for  is  to  find  a  number  of  necessary 
conditions  and  a  number  of  sufficient  conditions.  This  amounts  to  saying 
that  a  great  many  predicates  are  primitives,  but  they  are  primitives  that 
are  highly  interrelated  with  the  rest  of  the  knowledge  base. 

2.  One  should  determine  the  minimal  structure  necessary  for  a  concept 
to  make  sense.  In  efforts  to  axiomatize  an  area,  there  are  two  positions  one 
may  take,  exemplified  by  set  theory  and  by  group  theory.  In  axiomatiz- 
ing  set  theory,  one  attempts  to  capture  exactly  some  concept  that  one  has 
strong  intuitions  about.  If  the  axiomatization  turns  out  to  have  unexpected 
models,  this  exposes  an  inadequacy.  In  group  theory,  by  contrast,  one  char¬ 
acterizes  an  abstract  class  of  structures.  If  it  turns  out  that  there  are  un¬ 
expected  models,  this  is  a  serendipitous  discovery  of  a  new  phenomenon 
that  we  can  reason  about  using  an  old  theory.  The  pervasive  character 
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of  metaphor  in  natural  language  discourse  shows  that  our  commonsense 
theories  of  the  world  ought  to  be  much  more  like  group  theory  than  set 
theory.  By  seeking  minimal  structures  in  axiomatizing  concepts,  we  opti¬ 
mize  the  possibilities  of  using  the  theories  in  metaphorical  and  analogical 
contexts.  This  principle  is  illustrated  below  in  the  section  on  regions.  One 
consequence  of  this  principle  is  that  our  approach  will  seem  more  syntac¬ 
tic  than  semantic.  We  have  concentrated  more  on  specifying  axioms  than 
on  constructing  models.  Our  view  is  that  the  chief  role  of  models  in  our 
effort  is  for  proving  the  consistency  and  independence  of  sets  of  axioms, 
and  for  showing  their  adequacy.  As  an  example  of  the  last  point,  many 
of  the  spatial  and  temporal  theories  we  construct  are  intended  at  least  to 
have  Euclidean  space  or  the  real  numbers  as  one  model,  and  a  subclass  of 
graph-theoretical  structures  as  other  models. 

3.  A  balance  must  be  struck  between  attempting  to  cover  all  cases  and 
aiming  only  for  the  prototypical  cases.  In  general,  we  have  tried  to  cover  as 
many  cases  as  possible  with  an  elegant  axiomatization,  in  line  with  the  two 
previous  principles,  but  where  the  formalization  begins  to  look  baroque, 
we  assume  that  higher  processes  will  block  some  inferences  in  the  marginal 
cases.  We  assume  that  inferences  will  be  drawn  in  a  controlled  fashion. 
Thus,  every  outre,  highly  context-dependent  counterexample  need  not  be 
accounted  for,  and  to  a  certain  extent,  definitions  can  be  geared  specifically 
to  a  prototype. 

4.  Where  competing  ontologies  suggest  themselves  in  a  domain,  one 
should  try  to  construct  a  theory  that  accommodates  both.  Rather  than 
commit  oneself  to  adopting  one  set  of  primitives  rather  than  another,  one 
should  show  how  either  set  can  be  characterized  in  terms  of  the  other. 
Generally,  each  of  the  ontologies  is  useful  for  different  purposes,  and  it  is 
convenient  to  be  able  to  appeal  to  both.  Our  treatment  of  time  illustrates 
this. 

5.  The  theories  one  constructs  should  be  richer  in  axioms  than  in  theo¬ 
rems.  In  mathematics,  one  expects  to  state  half  a  dozen  axioms  and  prove 
dozens  of  theorems  from  them.  In  encoding  commonsense  knowledge,  it 
seems  to  be  just  the  opposite.  The  theorems  we  seek  to  prove  on  the  basis 
of  these  axioms  are  theorems  about  specific  situations  that  are  to  be  inter¬ 
preted,  in  particular,  theorems  about  a  text  that  the  system  is  attempting 
to  understand. 
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6.  One  should  avoid  falling  into  “black  holes” .  There  are  a  few  “mysteri¬ 
ous”  concepts  that  crop  up  repeatedly  in  the  formalization  of  commonsense 
metaphysics.  Among  these  are  “relevant”  (that  is,  relevant  to  the  task  at 
hand)  and  “normative”  (that  is,  conforming  to  some  norm  or  pattern).  To 
insist  upon  giving  a  satisfactory  analysis  of  these  before  using  them  in  an¬ 
alyzing  other  concepts  is  to  cross  the  event  horizon  that  separates  lexical 
semantics  from  philosophy.  On  the  other  hand,  our  experience  suggests  that 
to  avoid  their  use  entirely  is  crippling;  the  lexical  semantics  of  a  wide  vari¬ 
ety  of  other  terms  depends  upon  them.  Instead,  we  have  decided  to  leave 
them  minimally  analyzed  for  the  moment  and  use  them  without  scruple  in 
the  analysis  of  other  commonsense  concepts.  This  approach  will  allow  us  to 
accumulate  many  examples  of  the  use  of  these  mysterious  concepts,  and  in 
the  end,  contribute  to  their  successful  analysis.  The  use  of  these  concepts 
appears  below  in  the  discussions  of  the  words  “immediately”,  “sample”, 
and  “operate”. 

We  chose  as  an  initial  target  the  problem  of  encoding  the  commonsense 
knowledge  that  underlies  the  concept  of  “wear”,  as  in  a  part  of  a  device 
wearing  out.  Our  aim  was  to  define  “wear”  in  terms  of  predicates  char¬ 
acterized  elsewhere  in  the  knowledge  base  and  to  be  able  to  infer  some 
consequences  of  wear.  For  something  to  wear,  we  decided,  is  for  it  to  lose 
imperceptible  bits  of  material  from  its  surface  due  to  abrasive  action  over 
time.  One  goal,  which  we  have  not  yet  achieved,  is  to  be  able  to  prove  as 
a  theorem  that,  since  the  shape  of  a  part  of  a  mechanical  device  is  often 
functional  and  since  loss  of  material  can  result  in  a  change  of  shape,  wear 
of  a  part  of  a  device  can  cause  the  failure  of  the  device  as  a  whole.  In 
addition,  as  we  have  proceeded,  we  have  characterized  a  number  of  words 
found  in  a  set  of  target  texts,  as  it  has  become  possible. 

We  are  encoding  the  knowledge  as  axioms  in  what  is  for  the  most  part  a 
first-order  logic,  described  by  Hobbs  (1985a),  although  quantification  over 
predicates  is  sometimes  convenient.  In  the  formalism  there  is  a  nominal- 
ization  operator  “  1  ”  for  reifying  events  and  conditions,  as  expressed  in  the 
following  axiom  schema: 

(Vz)p(z)  =  (3e)p'(e,z)  A  Ezist(e) 

That  is,  p  is  true  of  x  if  and  only  if  there  is  a  condition  e  of  p’s  being  true 
of  x  and  e  exists  in  the  real  world. 
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In  our  implementation  so  far,  we  have  been  proving  simple  theorems 
from  our  axioms  using  the  CG5  theorem-prover  developed  by  Mark  Stickel 
(1982),  but  we  are  only  now  beginning  to  use  the  knowledge  base  in  text 
processing. 

2  Requirements  on  Arguments  of  Predicates 

There  is  a  notational  convention  used  below  that  deserves  some  explanation. 
It  has  frequently  been  noted  that  relational  words  in  natural  language  can 
take  only  certain  types  of  words  as  their  arguments.  These  are  usually 
described  as  selectional  constraints.  The  same  is  true  of  predicates  in  our 
knowledge  base.  They  are  expressed  below  by  rules  of  the  form 

p(x,y)  :  r(x,y) 

This  means  that  for  p  even  to  make  sense  applied  to  x  and  y,  it  must  be 
the  case  that  r  is  true  of  x  and  y.  The  logical  import  of  this  rule  is  that 
wherever  there  is  an  axiom  of  the  form 

v 

(Vz,y)p(z,y)  D  q{x,y) 
this  is  really  to  be  read  as 

(Vx,y)p(x,y)  A  r(x,y)  D  q{x,y ) 

The  checking  of  selectional  constraints,  therefore,  emerges  as  a  by-product 
of  other  logical  operations:  the  constraint  r(x,y)  must  be  verified  if  any¬ 
thing  else  is  to  be  proved  from  p(z,y). 

The  simplest  example  of  such  an  r(z,y)  is  a  conjunction  of  sort  con¬ 
straints  ri(x)  A  r2(y).  Our  approach  is  a  generalization  of  this,  because 
much  more  complex  requirements  can  be  placed  on  the  arguments.  Con¬ 
sider,  for  example,  the  verb  “range”.  If  x  ranges  from  y  to  z ,  there  must 
be  a  scale  s  that  includes  y  and  z ,  and  x  must  be  a  set  of  entities  that  are 
located  at  various  places  on  the  scale.  This  can  be  represented  as  follows: 

range(xty,z)  :  (3s)sca/e(s)  A  y6s 
AzEs  A  sef(z) 

A  (Vu)[u  G  x  D  (3v)v  G  s  A  af(u,v)] 
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3  The  Knowledge  Base 

3.1  Sets  and  Granularity 

At  the  foundation  of  the  knowledge  base  is  an  axiomatization  of  set  theory. 
It  follows  the  standard  Zermelo-Frankel  approach,  except  that  there  is  no 
axiom  of  infinity. 

Since  so  many  concepts  used  in  discourse  are  grain-dependent,  a  theory 
of  granularity  is  also  fundamental  (see  Hobbs  1985b).  A  grain  is  defined  in 
terms  of  an  indistinguishability  relation,  which  is  reflexive  and  symmetric, 
but  not  necessarily  transitive.  One  grain  can  be  a  refinement  of  another, 
with  the  obvious  definition.  The  most  refined  grain  is  the  identity  grain, 
i.e.,  the  one  in  which  every  two  distinct  elements  are  distinguishable.  One 
possible  relationship  between  two  grains,  one  of  which  is  a  refinement  of  the 
other,  is  what  we  call  an  “Archimedean  relation”,  after  the  Archimedean 
property  of  real  numbers.  Intuitively,  if  enough  events  occur  that  are  im¬ 
perceptible  at  the  coarser  grain  g 2  but  perceptible  at  the  finer  grain  <71,  the 
aggregate  will  eventually  be  perceptible  at  the  coarser  grain.  This  is  an 
important  property  in  phenomena  subject  to  the  heap  paradox.  Wear,  for 
instance,  eventually  has  significant  consequences. 

3.2  Scales 

A  great  many  of  the  most  common  words  in  English  have  scales  as  their 
subject  matter.  This  includes  many  prepositions,  the  most  common  ad¬ 
verbs,  comparatives,  and  many  abstract  verbs.  When  spatial  vocabulary 
is  used  metaphorically,  it  is  generally  the  scalar  aspect  of  space  that  car¬ 
ries  over  to  the  target  domain.  A  scale  is  defined  as  a  set  of  elements, 
together  with  a  partial  ordering  and  a  granularity  (or  an  indistinguishabil¬ 
ity  relation).  The  partial  ordering  and  the  indistinguishability  relation  are 
consistent  with  each  other: 

(Vz,y,2)z  <y  D  x  <  z  V  x  ~  z 

It  is  useful  to  have  an  adjacency  relation  between  points  on  a  scale,  and 
there  are  a  number  of  ways  we  could  introduce  it.  We  could  simply  take  it 
to  be  primitive;  in  a  scale  having  a  distance  function,  we  could  define  two 
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points  to  be  adjacent  when  the  distance  between  them  is  less  than  some  e; 
finally,  we  could  define  adjacency  in  terms  of  the  grain  size: 

(y  x,y,s)adj(x,y,s)  = 

(3  z)z  ~  x  A  z  ~  y  A  -i[:r  ~  y], 

Two  important  possible  properties  of  scales  are  connectedness  and  dense¬ 
ness.  We  can  say  that  two  elements  of  a  scale  are  connected  by  a  chain  of 
adj  relations: 

(Va:,y,s)con.n.ected(a:,y,s)  = 
adj{x,y,s )  V 

(3  z)adj(x,  z,s)  A  connected(z,y,s) 

A  scale  is  connected  ( sconnected )  if  all  pairs  of  elements  are  connected.  A 
scale  is  dense  if  between  any  two  points  there  is  a  third  point,  until  the 
two  points  are  so  close  together  that  the  grain  size  no  longer  allows  us 
to  determine  whether  such  an  intermediate  point  exists.  Cranking  up  the 
magnification  could  well  resolve  the  continuous  space  into  a  discrete  set,  as 
objects  into  atoms. 

(Vs)dense(s)  = 

(Vx,y,  <)x  E  s  A  y  E  s  A  order(<,s )  A  x  <  y 
D  (Bz)(x  <  z  A  z  <  y) 

V  (3  z)(i  ~  2  A  z  ~  y) 

This  expresses  the  commonsense  notion  of  continuity. 

A  subscale  of  a  scale  has  as  its  elements  a  subset  of  the  elements  of  the 
scale  and  has  as  its  partial  ordering  and  its  grain  the  partial  ordering  and 
the  grain  of  the  scale. 

(Vsi,<,  ~) order (<,  Si)  A  yram(~,Si) 

D  (Vs2)[-s«fc5ca/e(s255i) 

=  su6set(s2,^i)  A  order(<,s2)  A  yrain(~,s2)] 

An  interval  can  be  defined  as  a  connected  subscale: 

(Vi)mteruai(:)  =  (3  s)scaie(s) 

Asubscale(i,s)  A  sconnected(i) 
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The  relations  between  time  intervals  that  Allen  and  Kautz  (1985)  have 
defined  can  be  defined  in  a  straightforward  manner  in  the  approach  pre¬ 
sented  here,  but  for  intervals  in  general. 

A  concept  closely  related  to  scales  is  that  of  a  “cycle” .  This  is  a  system 
that  has  a  natural  ordering  locally  but  contains  a  loop  globally.  Examples 
are  the  color  wheel,  clock  times,  and  geographical  locations  ordered  by  “east 
of”.  We  have  axiomatized  cycles  in  terms  of  a  ternary  between  relation 
whose  axioms  parallel  those  for  a  partial  ordering. 

The  figure-ground  relationship  is  of  fundamental  importance  in  lan¬ 
guage.  We  encode  it  with  the  primitive  predicate  at.  It  is  possible  that 
the  minimal  structure  necessary  for  something  to  be  a  ground  is  that  of  a 
scale;  hence,  this  is  a  selectional  constraint  on  the  arguments  of  at. 

at(x,y)  :  (3s)y  E  s  A  sca/e(s) 

At  this  point,  we  are  already  in  a  position  to  define  some  fairly  complex 
words.  As  an  illustration,  we  give  the  example  of  “range”  as  in  ux  ranges 
from  y  to  z” : 

[Vx,y,z)range(x,y,z)  = 

(3  s,  si,  Ui,  U2)scale(s)  A  subscale(si,s) 

Abottom(y,s\)  A  top(z,  Si) 

AuiGi  A  at(u i,  y) 

A«2€i  A  at(u2,z) 

A  (Vu)[u  E  x  D  (3v)u  E  si  A  at(u,v)] 

A  very  important  scale  is  the  linearly  ordered  scale  of  numbers.  We 
do  not  plan  to  reason  axiomatically  about  numbers,  but  it  is  useful  in 
natural  language  processing  to  have  encoded  a  few  facts  about  numbers. 
For  example,  a  set  has  a  cardinality  which  is  an  element  of  the  number 
scale. 

Verticality  is  a  concept  that  would  most  properly  be  analyzed  in  the 
section  on  space,  but  it  is  a  property  that  many  other  scales  have  acquired 
metaphorically,  for  whatever  reason.  The  number  scale  is  one  of  these. 
Even  in  the  absence  of  an  analysis  of  verticality,  it  is  a  useful  property  to 
have  as  a  primitive  in  lexical  semantics. 

The  word  “high”  is  a  vague  term  asserting  that  an  entity  is  in  the 
upper  region  of  some  scale.  It  requires  that  the  scale  be  a  vertical  one, 
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such  as  the  number  scale.  The  verticality  requirement  distinguishes  “high” 
from  the  more  general  term  “very” ;  we  can  say  “very  hard”  but  not  “highly 
hard” .  The  phrase  “highly  planar”  sounds  all  right  because  the  high  register 
of  “planar”  suggests  a  quantifiable,  scientific  accuracy,  whereas  the  low 
register  of  “flat”  makes  “highly  flat”  sound  much  worse. 

The  test  of  any  definition  is  whether  it  allows  one  to  draw  the  appropri¬ 
ate  inferences.  In  our  target  texts,  the  phrase  “high  usage”  occurs.  Usage  is 
a  set  of  using  events,  and  the  verticality  requirement  on  “high”  forces  us  to 
coerce  the  phrase  into  “a  high  or  large  number  of  using  events”.  Combin¬ 
ing  this  with  an  axiom  stating  that  the  use  of  a  mechanical  device  involves 
the  likelihood  of  abrasive  events,  as  defined  below,  and  with  the  definition 
of  “wear”  in  terms  of  abrasive  events,  we  should  be  able  to  conclude  the 
likelihood  of  wear. 

3.3  Time:  Two  Ontologies 

There  are  two  possible  ontologies  for  time.  In  the  first,  the  one  most  ac¬ 
ceptable  to  the  mathematically  minded,  there  is  a  time  line,  which  is  a 
scale  having  some  topological  structure.  We  can  stipulate  the  time  line  to 
be  linearly  ordered  (although  it  is  not  in  approaches  that  build  ignorance 
of  relative  times  into  the  representation  of  time  (e.g.,  Hobbs,  1974)  nor  in 
approaches  employing  branching  futures  (e.g.,  McDermott,  1985)),  and  we 
can  stipulate  it  to  be  dense  (although  it  is  not  in  the  situation  calculus). 
We  take  before  to  be  the  ordering  on  the  time  line: 

(Vti,t2)6e/ore(t1,t2)  = 

(3  T,  <)Time-line[T)  A  order {<,T) 

A  ti  £  T  A  tj  £  T  A  <  t2 

We  allow  both  instants  and  intervals  of  time.  Most  events  occur  at  some 
instant  or  during  some  interval.  In  this  approach,  nearly  every  predicate 
takes  a  time  argument. 

In  the  second  ontology,  the  one  that  seems  to  be  more  deeply  rooted  in 
language,  the  world  consists  of  a  large  number  of  more  or  less  independent 
processes,  or  histories,  or  sequences  of  events.  There  is  a  primitive  relation 
change  between  conditions.  Thus, 

change[ei,  e2)  A  p'{eux)  A  g'{e2,x) 
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says  that  there  is  a  change  from  the  condition  t\  of  p’s  being  true  of  x  to 
the  condition  e2  of  g's  being  true  of  x. 

The  time  line  in  this  ontology  is  then  an  artificial  construct,  a  regular 
sequence  of  imagined  abstract  events  (think  of  them  as  ticks  of  a  clock  in 
the  National  Bureau  of  Standards)  to  which  other  events  can  be  related. 
The  change  ontology  seems  to  correspond  to  the  way  we  experience  the 
world.  We  recognize  relations  of  causality,  change  of  state,  and  copresence 
among  events  and  conditions.  When  events  are  not  related  in  these  ways, 
judgments  of  relative  time  must  be  mediated  by  copresence  relations  be¬ 
tween  the  events  and  events  on  a  clock  and  change  of  state  relations  on  the 
clock. 

The  predicate  change  possesses  a  limited  transitivity.  There  has  been 
a  change  from  Reagan’s  being  an  actor  to  Reagan’s  being  president,  even 
though  he  was  governor  in  between.  But  we  probably  do  not  want  to 
say  there  has  been  a  change  from  Reagan’s  being  an  actor  to  Margaret 
Thatcher’s  being  prime  minister,  even  though  the  second  event  comes  after 
the  first. 

In  this  ontology,  we  can  say  that  any  two  times,  viewed  as  events,  always 
have  a  change  relation  between  them. 

(V , f 2 )before{tittt)  D  change{t\,t2) 

The  predicate  change  is  related  to  before  by  the  axiom 

e1,e2)change{eue2)  Z) 

(3 ti,t2)at(e1,t1) 

A  a t(e2,t2)  A  before(ti,t2) 

This  does  not  allow  us  to  derive  change  of  state  from  temporal  succession. 
For  this,  we  need  axioms  of  the  form 

(Vei,e2,t1,«2,x)p,(fi1,x)  A  at(ei,fi) 

Ag'[e2,x)  A  af(e2,t2)  A  before{ti,t2) 

Z>  change(ei,e2) 

That  is,  if  x  is  p  at  time  f*  and  g  at  a  later  time  t2,  then  there  has  been  a 
change  of  state  from  one  to  the  other.  Time  arguments  in  predications  can 
be  viewed  as  abbreviations: 
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(V x,t)p(x,t)  =  (3e)p'(e,i)  A  at(e,t) 

The  word  “move”,  or  the  predicate  move,  (as  in  “x  moves  from  y  to  zv) 
can  then  be  defined  equivalently  in  terms  of  change, 

(V x,  y,  z)move(x,  y,z)  = 

(3ei,e2)change(e1,e2) 

Aaf'(ei,x,y)  A  at'(e2,x,z ) 

or  in  terms  of  the  time  line, 

(V x,  y,  z)move(x,  y,  z)  = 

(3f1,t2)at(x,y,ti)  A  at[x,z,t2 )  A  6e/ore(t1}t2) 

In  English  and  apparently  all  other  natural  languages,  both  ontologies 
are  represented  in  the  lexicon.  The  time  line  ontology  is  found  in  clock  and 
calendar  terms,  tense  systems  of  verbs,  and  in  the  deictic  temporal  locatives 
such  as  “yesterday”,  “today”,  “tomorrow”,  “last  night”,  and  so  on.  The 
change  ontology  is  exhibited  in  most  verbs,  and  in  temporal  clausal  connec¬ 
tives.  The  universal  presence  in  natural  languages  of  both  classes  of  lexical 
items  and  grammatical  markers  requires  a  theory  that  can  accommodate 
both  ontologies,  illustrating  the  importance  of  methodological  principle  4. 

Among  temporal  connectives,  the  word  “while”  presents  interesting 
problems.  In  “ei  while  e2”,  e2  must  be  an  event  occurring  over  a  time 
interval;  ei  must  be  an  event  and  may  occur  either  at  a  point  or  over  an  in¬ 
terval.  One’s  first  guess  is  that  the  point  or  interval  for  ej  must  be  included 
in  the  interval  for  e2.  However,  there  are  cases,  such  a s 

It  rained  while  I  was  in  Philadelphia. 


or 


The  electricity  should  be  off  while  the  switch  is  being  repaired. 

which  suggest  the  reading  “e2  is  included  in  ei” .  We  came  to  the  conclusion 
that  one  can  infer  no  more  than  that  ei  and  e2  overlap,  and  any  tighter 
constraints  result  from  implicatures  from  background  knowledge. 

The  word  “immediately”  also  presents  a  number  of  problems.  It  requires 
its  argument  e  to  be  an  ordering  relation  between  two  entities  x  and  y  on 
some  scale  s. 
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immediate(e)  :  (3x,  y,s)/ess-than'(e,x,y,s) 

It  is  not  clear  what  the  constraints  on  the  scale  are.  Temporal  and  spatial 
scales  are  acceptable,  as  in  “immediately  after  the  alarm”  and  “immediately 
to  the  left” ,  but  the  size  scale  is  not: 

*  John  is  immediately  larger  than  Bill. 

Etymologically,  it  means  that  there  are  no  intermediate  entities  between  x 
and  y  on  s.  Thus, 

(Ve,x,  y,s)immediate(e)  A  less-than'(e,  x,  y,  5) 

D  ->(3  z)less-than[x,  z,s)  A  less-tkan(z,y,s) 

However,  this  will  only  work  if  we  restrict  z  to  be  a  relevant  entity.  For 
example,  in  the  sentence 

We  disengaged  the  compressor  immediately  after  the  alarm. 

the  implication  is  that  no  event  that  could  damage  the  compressor  oc¬ 
curred  between  the  alarm  and  the  disengagement,  since  the  text' is  about 
equipment  failure. 

3.4  Spaces  and  Dimension:  The  Minimal  Structure 

The  notion  of  dimension  has  been  made  precise  in  linear  algebra.  Since  the 
concept  of  a  region  is  used  metaphorically  as  well  as  in  the  spatial  sense, 
however,  we  were  concerned  to  determine  the  minimal  structure  a  system 
requires  for  it  to  make  sense  to  call  it  a  space  of  more  than  one  dimension. 
For  a  two-dimensional  space,  there  must  be  a  scale,  or  partial  ordering,  for 
each  dimension.  Moreover,  the  two  scales  must  be  independent,  in  that  the 
order  of  elements  on  one  scale  can  not  be  determined  from  their  order  on 
the  other.  Formally, 

(Vsp)space(sp)  = 

(3si,s2,<i,<2)-scale1(s1,sp)  A  scale2{s2,sp) 

A  order(<i,5i)  A  order(<2,s2) 

A  (3  x)(3yi)(x  <1  yi  A  X  <2  y^ 

A(3y2)(x  <!  y2  A  y2  <2  x) 
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Note  that  this  does  not  allow  <2  to  be  simply  the  reverse  of  <1.  An 
unsurprising  consequence  of  this  definition  is  that  the  minimal  example  of 
a  two-dimensional  space  consists  of  three  points  (three  points  determine  a 
plane),  e.g.,  the  points  A,  B,  and  C,  where 

A  <1  B ,  A  <1  C,  C  <2  A ,  A  <2  B. 

This  is  illustrated  in  Figure  1. 

The  dimensional  scales  are  apparently  found  in  all  natural  languages  in 
relevant  domains.  The  familiar  three-dimensional  space  of  common  sense  is 
defined  by  the  three  scale  pairs  “up-down” ,  “front-back” ,  and  “left-right” ; 
the  two-dimensional  plane  of  the  commonsense  conception  of  the  earth’s 
surface  is  represented  by  the  two  scale  pairs  “north-south”  and  “east-west” . 

The  simplest,  although  not  the  only,  way  to  define  adjacency  in  the 
space  is  as  adjacency  on  both  scales: 

(Vx,y,sp)adj(x,y,sp)  = 

(3si,s2)sca/ei(si,sp)  A  scale2(s2,  sp) 

A  adj(x,y,Si)  A  adj(x,y,s2) 

A  region  is  a  subset  of  a  space.  The  surface  and  interior  of  a  region  can 
be  defined  in  terms  of  adjacency,  in  a  manner  paralleling  the  definition  of 
a  boundary  in  point-set  topology.  In  the  following,  s  is  the  boundary  or 
surface  of  a  two-  or  three-dimensional  region  r  embedded  in  a  space  sp. 

(V  s,r)surf  ace(s,r,sp)  = 
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(Vx)x  G  r  D  [x  g  s  = 

{Ey){y  6  sp  A  ~>(y  G  r)  A  adj(x,y,  sp))] 

Finally,  we  can  define  the  notion  of  “context”  in  terms  of  points  in  different 
regions  being  adjacent: 

( Vri,  r2,sp)  contact  (ri,r2,sp)  = 
disjoint (ri, r2)  A 

(2?x,y)(x  G  n  A  y  G  r2  A  adj'(x,  y,  sp)) 

By  picking  the  scales  and  defining  adjacency  right,  we  can  talk  about 
points  of  contact  between  communication al  networks,  systems  of  knowl¬ 
edge,  and  other  metaphorical  domains.  By  picking  the  scales  to  be  the  real 
line  and  defining  adjacency  in  terms  of  e-neighborhoods,  we  get  Euclidean 
space  and  can  talk  about  contact  between  physical  objects. 

3.5  Material 

Physical  objects  and  materials  must  be  distinguished,  just  as  they  are  in 
apparently  every  natural  language,  by  means  of  the  count  noun  -  mass 
noun  distinction.  A  physical  object  is  not  a  bit  of  material,  but  rather  is 
comprised  of  a  bit  of  material  at  any  given  time.  Thus,  rivers  and  human 
bodies  are  physical  objects,  even  though  their  material  constitution  changes 
over  time.  This  distinction  also  allows  us  to  talk  about  an  object’s  losing 
material  through  wear  and  still  remaining  the  same  object. 

We  will  say  that  an  entity  6  is  a  bit  of  material  by  means  of  the  ex¬ 
pression  material(b).  Bits  of  material  are  characterized  by  both  extension 
and  cohesion.  The  primitive  predication  occupies(b,  r,  t)  encodes  extension, 
saying  that  a  bit  of  material  b  occupies  a  region  r  at  time  t.  The  topology 
of  a  bit  of  material  is  then  parasitic  on  the  topology  of  the  region  it  occu¬ 
pies.  A  part  bi  of  a  bit  of  material  b  is  a  bit  of  material  whose  occupied 
region  is  always  a  subregion  of  the  region  occupied  by  b.  Point-like  particles 
(particle)  are  defined  in  terms  of  points  in  the  occupied  region,  disjoint  bits 
( disjointbit )  in  terms  of  the  disjointness  of  regions,  and  contact  between 
bits  in  terms  of  contact  between  their  regions.  We  can  then  state  as  fol¬ 
lows  the  principle  of  non-joint-occupancy  that  two  bits  of  material  cannot 
occupy  the  same  place  at  the  same  time: 
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(V  bi,b2)(disjointbit(bi,b2) 

D  (Vz,  y,  63, 64) interior (63, &i) 

A  interior  (64,62)  A  particle(x,  63) 

Aparticle(y,  64) 

D  ~'(Ez)(at(z,  z)  A  at(y,z)) 

At  some  future  point  in  our  work,  this  may  emerge  as  a  consequence  of  a 
richer  theory  of  cohesion  and  force. 

The  cohesion  of  materials  is  also  a  primitive  property,  for  we  must  dis¬ 
tinguish  between  a  bump  on  the  surface  of  an  object  and  a  chip  merely 
lying  on  the  surface.  Cohesion  depends  on  a  primitive  relation  bond  be¬ 
tween  particles  of  material,  paralleling  the  role  of  adj  in  regions.  The 
relation  attacked  is  defined  as  the  transitive  closure  of  bond.  A  topology  of 
cohesion  is  built  up  in  a  manner  analogous  to  the  topology  of  regions.  In 
addition,  we  have  encoded  the  relation  that  bond  bears  to  motion,  i.e.  that 
bonded  bits  remain  adjacent  and  that  one  moves  when  the  other  does,  and 
the  relation  of  bond  to  force,  i.e.  that  there  is  a  characteristic  force  that 
breaks  a  bond  in  a  given  material. 

Different  materials  react  in  different  ways  to  forces  of  various  strengths. 
Materials  subjected  to  force  exhibit  or  fail  to  exhibit  several  invariance 
properties,  proposed  by  Hager  (1985).  If  the  material  is  shape-invariant 
with  respect  to  a  particular  force,  its  shape  remains  the  same.  If  it  is 
topologically  invariant,  particles  that  are  adjacent  remain  adjacent.  Shape 
invariance  implies  topological  invariance.  If  subjected  to  forces  of  a  certain 
strength  or  degree  <£1}  a  material  ceases  being  shape-invariant.  At  a  force 
of  strength  d 2  >  d\,  it  ceases  being  topologically  invariant,  and  at  a  force 
of  strength  d3  >  d2,  it  simply  breaks.  Metals  exhibit  the  full  range  of  pos¬ 
sibilities,  that  is,  0  <  d\  <  d2  <  ds  <  00.  For  forces  of  strength  d  <  d\,  the 
material  is  “hard”;  for  forces  of  strength  d  where  di  <  d  <  d2}  it  is  “flexi¬ 
ble”;  for  forces  of  strength  d  where  d2  <  d  <  d3,  it  is  “malleable”.  Words 
such  as  “ductile”  and  “elastic”  can  be  defined  in  terms  of  this  vocabulary, 
together  with  predicates  about  the  geometry  of  the  bit  of  material.  Words 
such  as  “brittle”  (d2  =  d2  =  d3)  and  “fluid”  ( d2  =  0,d3  =  00)  can  also 
be  defined  in  these  terms.  While  we  should  not  expect  to  be  able  to  de¬ 
fine  various  material  terms,  like  “metal”  and  “ceramic”,  we  can  certainly 
characterize  many  of  their  properties  with  this  vocabulary. 
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Because  of  its  invariance  properties,  material  interacts  with  containment 
and  motion.  The  word  “clog”  illustrates  this.  The  predicate  clog  is  a  three- 
place  relation:  z  clogs  y  against  the  flow  of  z.  It  is  the  obstruction  by  z 
of  z’s  motion  through  y,  but  with  the  selectional  restriction  that  z  must 
be  something  that  can  flow,  such  as  a  liquid,  gas,  or  powder.  If  a  rope  is 
passing  through  a  hole  in  a  board,  and  a  knot  in  the  rope  prevents  it  from 
going  through,  we  do  not  say  that  the  hole  is  clogged.  On  the  other  hand, 
there  do  not  seem  to  be  any  selectional  constraints  on  z.  In  particular, 
x  can  be  identical  with  z :  glue,  sand,  or  molasses  can  clog  a  passageway 
against  its  own  flow.  We  can  speak  of  clogging  where  the  obstruction  of 
flow  is  not  complete,  but  it  must  be  thought  of  as  “nearly”  complete. 

3.6  Other  Domains 

3.6.1  Causal  Connection 

Attachment  within  materials  is  one  variety  of  causal  connection.  In  general, 
if  two  entities  z  and  y  are  causally  connected  with  respect  to  some  behavior 
p  of  z,  then  whenever  p  happens  to  z,  there  is  some  corresponding  behavior 
q  that  happens  to  y.  In  the  case  of  attachment,  p  and  q  are  both  move.  A 
particularly  common  kind  of  causal  connection  between  two  entities  is  one 
mediated  by  the  motion  of  a  third  entity  from  one  to  the  other.  (This  might 
be  called  a  “vector  boson”  connection.)  Photons  mediating  the  connection 
between  the  sun  and  our  eyes,  raindrops  connecting  a  state  of  the  clouds 
with  the  wetness  of  our  skin  and  clothes,  a  virus  being  transmitted  from  one 
person  to  another,  and  utterances  passing  between  people  are  all  examples 
of  such  causal  connections.  Barriers,  openings,  and  penetration  are  all 
defined  with  respect  to  paths  of  causal  connection. 

3.6.2  Force 

The  concept  of  “force”  is  axiomatized,  in  a  way  consistent  with  Talmy’s 
treatment  (1985),  in  terms  of  the  predications  force(a,  b ,  di)  and  resist(b,  a,  d2) 
—  a  forces  against  b  with  strength  di  and  b  resists  a’s  action  with  strength 
d2.  We  can  infer  motion  from  facts  about  relative  strength.  This  treat¬ 
ment  can  also  be  specialized  to  Newtonian  force,  where  we  have  not  merely 
movement,  but  acceleration.  In  addition,  in  spaces  in  which  orientation  is 
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defined,  forces  can  have  an  orientation,  and  a  version  of  the  “parallelogram 
of  forces”  law  can  be  encoded.  Finally,  force  interacts  with  shape  in  ways 
characterized  by  words  like  “stretch”,  “compress”,  “bend”,  “twist”,  and 
“shear” . 

3.6.3  Systems  and  Functionality 

An  important  concept  is  the  notion  of  a  “system” ,  which  is  a  set  of  entities, 
a  set  of  their  properties,  and  a  set  of  relations  among  them.  A  common 
kind  of  system  is  one  in  which  the  entities  are  events  and  conditions  and 
the  relations  are  causal  and  enabling  relations.  A  mechanical  device  can 
be  described  as  such  a  system — in  a  sense,  in  terms  of  the  plan  it  executes 
in  its  operation.  The  function,  of  various  parts  and  of  conditions  of  those 
parts  is  then  the  role  they  play  in  this  system,  or  plan. 

The  intransitive  sense  of  “operate” ,  as  in 

The  diesel  was  operating. 

involves  systems  and  functionality.  If  an  entity  x  operates,  there  must  be 
a  larger  system  s  of  which  x  is  a  part.  The  entity  x  itself  is  a  system 
with  parts.  These  parts  undergo  normative  state  changes,  thereby  causing 
x  to  undergo  normative  state  changes,  thereby  causing  x  to  produce  an 
effect  with  a  normative  function  in  the  larger  system  s.  The  concept  of 
“normative”  is  discussed  below. 

3.6.4  Shape 

We  have  been  approaching  the  problem  of  characterizing  shape  from  a  num¬ 
ber  of  different  angles.  The  classical  treatment  of  shape  is  via  the  notion  of 
“similarity”  in  Euclidean  geometry,  and  in  Hilbert’s  formal  reconstruction 
of  Euclidean  geometry  (Hilbert,  1902)  the  key  primitive  concept  seems  to  be 
that  of  “congruent  angles”.  Therefore,  we  first  sought  to  develop  a  theory 
of  “orientation”.  The  shape  of  an  object  can  then  be  characterized  in  terms 
of  changes  in  orientation  of  a  tangent  as  one  moves  about  on  the  surface 
of  the  object,  as  is  done  in  vision  research  (e.g.,  Zahn  and  Roskies,  1972). 
In  all  of  this,  since  “shape”  can  be  used  loosely  and  metaphorically,  one 
question  we  are  asking  is  whether  some  minimal,  abstract  structure  can  be 
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found  in  which  the  notion  of  “shape"  makes  sense.  Consider,  for  instance, 
a  graph  in  which  one  scale  is  discrete,  or  even  unordered.  Accordingly,  we 
have  been  examining  a  number  of  examples,  asking  when  it  seems  right  to 
say  two  structures  have  different  shapes. 

We  have  also  examined  the  interactions  of  shape  and  functionality  (see 
Davis,  1984).  What  seems  to  be  crucial  is  how  the  shape  of  an  obstacle 
constrains  the  motion  of  a  substance  or  of  an  object  of  a  particular  shape 
(see  Shoham,  1985).  Thus,  a  funnel  concentrates  the  flow  of  a  liquid,  and 
similarly,  a  wedge  concentrates  force.  A  box  pushed  against  a  ridge  in  the 
floor  will  topple,  and  a  wheel  is  a  limiting  case  of  continuous  toppling. 

3.7  Hitting,  Abrasion,  Wear,  and  Related  Concepts 

For  x  to  hit  y  is  for  x  to  move  into  contact  with  y  with  some  force. 

The  basic  scenario  for  an  abrasive  event  is  that  there  is  an  impinging 
bit  of  material  m  that  hits  an  object  o  and  by  doing  so  removes  a  pointlike 
bit  of  material  60  from  the  surface  of  o: 

abr-event'(e,m,o,b0 )  :  material  (m) 

A  topologically-invariant{o ) 

(V  e,  m,  o,  bQ)abr-event'(e,  m,  o,  b0 )  = 

(3f,&,s,60,ei,e2,e3)af(e,f) 

A  consists-of(o,  b,  t)  A  surface(s,b ) 

Aparticle(b0,s )  A  change'{e,  e\,  e2) 

A  attacked' (e\,  bo,  b)  A  nof'(e2,ej) 

Acouse(e3,e)  A  hit'(es,m,b0 ) 

After  the  abrasive  event,  the  pointlike  bit  b0  is  no  longer  a  part  of  the  object 
o: 

(Ve,  m,  o,  b0,  ei ,  e2,  t2)abr-event' [e,  m,  o,  b0) 

A  ckange'(e,  elf  e2)  A  attached' (ei,b0,  b) 

A  not'(e2,e1)  A  af(e2,t2) 

A  consists-of  (o,  b2 ,  £2) 

D  ~>part{b0,b2) 

It  is  necessary  to  state  this  explicitly  since  objects  and  bits  of  material  can 
be  discontinuous. 
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An  abrasion  is  a  large  number  of  abrasive  events  widely  distributed 
through  some  nonpointlike  region  on  the  surface  of  an  object: 

(Ve,m,  o)abradel(e,m,o)  = 

(3  6s)[(Vei)[ei  60 

(3  b0)b0  G  6s  A  abr-event'(ei,m,o,  b0)] 

A  (V6,  s,f)[af(e,t) 

A  const sts-of(o,  b,  t)  A  surface(s,b ) 

D  (3r)subregion(r,s) 

A  widely-distributed(bs ,  r)]] 

Wear  can  result  from  a  large  collection  of  abrasive  events  distributed 
over  time  as  well  as  space  (so  that  there  may  be  no  instant  at  which  enough 
abrasive  events  occur  to  count  as  an  abrasion).  Thus,  the  link  between 
wear  and  abrasion  is  via  the  common  notion  of  abrasive  events,  not  via  a 
definition  of  wear  in  terms  of  abrasion. 

(Ve,m,o)u>ear'(e,x,o)  = 

(36s)(V e1)[ei  60 

(36o)60  €  6s)  A  abr-event'{ei,m,  o,  60)] 

A  (3  i)  [interval  (i)  A  widely-distributed(e,i)\ 

The  concept  “widely  distributed”  concerns  systems.  If  x  is  distributed 
in  y ,  then  y  is  a  system  and  x  is  a  set  of  entities  which  are  located  at 
components  of  y.  For  the  distribution  to  be  wide,  most  of  the  elements  of  a 
partition  of  y,  determined  independently  of  the  distribution,  must  contain 
components  which  have  elements  of  x  at  them. 

The  word  “wear”  is  one  of  a  large  class  of  other  events  involving  cu¬ 
mulative,  gradual  loss  of  material  -  events  described  by  words  like  “chip”, 
“corrode”,  “file”,  “erode”,  “rub”,  “sand”,  “grind”,  “weather”,  “rust”,  “tar¬ 
nish”,  “eat  away”,  “rot”,  and  “decay”.  All  of  these  lexical  items  can  now 
be  defined  as  variations  on  the  definition  of  “wear” ,  since  we  have  built  up 
the  axiomatizations  underlying  “wear” .  We  are  now  in  a  position  to  char¬ 
acterize  the  entire  class.  We  will  illustrate  this  by  defining  two  different 
types  of  variants  of  “wear”  -  “chip”  and  “corrode” . 

“Chip”  differs  from  “wear”  in  three  ways:  the  bit  of  material  removed  in 
one  abrasive  event  is  larger  (it  need  not  be  point-like),  it  need  not  happen 
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because  of  a  material  hitting  against  the  object,  and  “chip”  does  not  require 
(though  it  does  permit)  a  large  collection  of  such  events:  one  can  say  that 
some  object  is  chipped  if  there  is  only  one  chip  in  it.  Thus,  we  slightly  alter 
the  definition  of  abr-event  to  accommodate  these  changes: 

(Ve,m,o,  bo)chip'[e,m,o,b0)  = 

(Bt,6,s,6o,e1)e2,e3)af(e,i) 

Aconsists-of(o,  b,  t)  A  surface(s,b) 

Apart(b0,s)  A  change'{e,  eit  e2) 

A  attached' (ei,  bo,  b)  A  nof'(e2,e1) 

“Corrode”  differs  from  “wear”  in  that  the  bit  of  material  is  chemically 
transformed  as  well  as  being  detached  by  the  contact  event;  in  fact,  in 
some  way  the  chemical  transformation  causes  the  detachment.  This  can 
be  captured  by  adding  a  condition  to  the  abrasive  event  that  renders  it  a 
(single)  corrode  event: 

corrode-event(m,o,bo)  :  fluid(m ) 

A  contact  {m, bo ) 

(V  e,  m,  o,  &o)  corrode-event'(e,  m,  o,  60)  = 

(3f,6,s,&o5ei,e2,e3)at(e,t) 

A  consists-of(o,b,t)  A  surface(s,b) 

Aparticle(bo,s)  A  change' {e,e i,e2) 

A  attached' (eu  b0,  b)  A  nof'(e2,e i) 

Acause(e3,e)  A  chemical-change'{ez,m,bo) 

“Corrode”  itself  may  be  defined  in  a  parallel  fashion  to  “wear”,  by 
substituting  corrode-event  for  abr-event. 

All  of  this  suggests  the  generalization  that  abrasive  events,  chipping 
and  corrode  events  all  detach  the  bit  in  question,  and  that  we  may  describe 
all  of  these  as  detaching  events.  We  can  then  generalize  the  above  axiom 
about  abrasive  events  that  result  in  loss  of  material  to  the  following  axiom 
about  detaching: 

(V e,  m,  o,  bo,  &2,  ei,  e2,t2) detach' (e,m,  o,bo) 

A  change' (e,ei,  e2)  A  attached' (ei,b0,b) 

A  nof'(e2,  ei)  A  a£(e2,£2) 

A  const sts-of(o,  62 ,  £2) 

D  -i (part {b0,b2)) 
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4  Relevance  and  the  Normative 


Many  of  the  concepts  we  are  investigating  have  driven  us  inexorably  to  the 
problems  of  what  is  meant  by  “relevant”  and  by  “normative”.  We  do  not 
pretend  to  have  solved  these  problems.  But  for  each  of  these  concepts  we 
do  have  the  beginnings  of  an  account  that  can  play  a  role  in  analysis,  if  not 
yet  in  implementation. 

Our  view  of  relevance,  briefly  stated,  is  that  something  is  relevant  to 
some  goal  if  it  is  a  part  of  a  plan  to  achieve  that  goal.  (A  formal  treatment 
of  a  similar  view  is  given  in  Davies  and  Russell,  1986.)  We  can  illustrate 
this  with  an  example  involving  the  word  “sample” .  If  a  bit  of  material  x  is 
a  sample  of  another  bit  of  material  y,  then  re  is  a  part  of  y,  and  moreover, 
there  are  relevant  properties  p  and  q  such  that  it  is  believed  that  if  p  is  true 
of  x  then  q  is  true  of  y.  That  is,  looking  at  the  properties  of  the  sample 
tells  us  something  important  about  the  properties  of  the  whole.  Frequently, 
p  and  q  are  the  same  property.  In  our  target  texts,  the  following  sentence 
occurs: 

We  retained  an  oil  sample  for  future  inspection. 

The  oil  in  the  sample  is  a  part  of  the  total  lube  oil  in  the  lube  oil  system, 
and  it  is  believed  that  a  property  of  the  sample,  such  as  “contaminated 
with  metal  particles”,  will  be  true  of  all  the  lube  oil  as  well,  and  that  this 
will  provide  information  about  possible  wear  on  the  bearings.  It  is  therefore 
relevant  to  the  goal  of  maintaining  the  machinery  in  good  working  order. 

We  have  arrived  at  the  following  provisional  account  of  what  it  means  to 
be  “normative” .  For  an  entity  to  exhibit  a  normative  condition  or  behavior, 
it  must  first  of  all  be  a  component  of  a  larger  system.  This  system  has 
structure  in  the  form  of  relations  among  its  components.  A  pattern  is  a 
property  of  the  system,  namely,  the  property  of  a  subset  of  these  stuctural 
relations  holding.  A  norm  is  a  pattern  established  either  by  conventional 
stipulation  or  by  statistical  regularity.  An  entity  behaves  in  a  normative 
fashion  if  it  is  a  component  of  a  system  and  instantiates  a  norm  within 
that  system.  The  word  “operate”,  discussed  in  Section  3.6.3,  illustrates 
this.  When  we  say  that  an  engine  is  operating,  we  have  in  mind  a  larger 
system — i.e.,  the  device  the  engine  drives — to  which  the  engine  may  bear 
various  possible  relations.  A  subset  of  these  relations  is  stipulated  to  be 
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the  norm — the  way  it  is  supposed  to  work.  We  say  it  is  operating  when  it 
is  instantiating  this  norm. 


5  Conclusion 

The  research  we  have  been  engaged  in  has  forced  us  to  explicate  a  complex 
set  of  commonsense  concepts.  Since  we  have  done  it  in  as  general  a  fashion 
as  possible,  we  expect  to  be  able,  building  on  this  foundation,  to  axiomatize 
a  large  number  of  other  areas,  including  areas  unrelated  to  mechanical 
devices,  building  on  this  foundation.  The  very  fact  that  we  have  been 
able  to  characterize  words  as  diverse  as  “range” ,  “immediately” ,  “brittle” , 
“operate” ,  and  “wear”  shows  the  promising  nature  of  this  approach. 
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